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Improving the Partial-Global Stochastic Metropolis Update for 
Dynamical Smeared Link Fermions 

A. Hasenfratz^ and A. Alexandru^ 

'^Department of Physics, University of Colorado, Campus Box 390, Boulder, CO 80309, USA 

We discuss several methods that improve the partial-global stochastic Metropolis (PGSM) algorithm for 
smeared link staggered fermions. We present autocorrelation time measurements and argue that this update 
is feasible even on reasonably large lattices. 



Smeared link actions have gained popularity 
in recent years. These actions have many de- 
sirable features: improved flavor symmetry with 
staggered fermions, smaller perturbative match- 
ing factors, better topology, etc Q. What hinders 
the use of these smeared actions is the difficulty 
of dynamical simulations with standard molecular 
dynamics type algorithms. Recently it was sug- 
gested that even projected smeared link actions 
can be effectively simulated with partial-global 
heatbath or over-relaxation updates that are com- 
bined with a Metropolis type accept-reject step 
j2|. In this paper we summarize some of the im- 
provements that make this partial-global stochas- 
tic Metropolis (PGSM) update truly effective. 

1. THE SMEARED LINK ACTION AND 
THE PARTIAL-GLOBAL UPDATE 

We consider a smeared link action of the form 



(1) 



where Sg{U) and Sg{V) are gauge actions de- 
pending on the thin links {U} and smeared links 
{V}^ respectively, and Sf is the fermionic ac- 
tion depending on the smeared links only. The 
smeared links are constructed deterministically. 
Here we use HYP blocking Sg{U) is an arbi- 
trary gauge action, and Sg{V) contains 4 and 6- 
link loops of the smeared links. We choose the co- 
efficients of Sg{V) to improve computational effi- 
ciency. The fermionic action describing n f degen- 
erate flavors with staggered fermions is Sf{V) = 



-^indet n(y) with niv) = {m^v)m{v)) 

defined on even sites only. 

We use the partial-global stochastic updating 
(PGSM) algorithm to simulate this system [^. 
The PGSM algorithm is a variation of the pseudo- 
fermion method and satisfies the detailed balance 
equation. First a subset of the thin finks {U} are 
updated and a new thin gauge link configuration 
{[/'} is proposed such that the transition proba- 
bility p{U, U') of the update satisfies detailed bal- 
ance with Sg{U) . The proposed configuration is 
accepted with the probability 



Pace = min{l,e^ 



(2) 



where ASg = Sg{V') - Sg{V), the matrix A = 
fl ~^^^nn and the stochastic vector ^ is gen- 
erated with Gaussian distribution, exp(— ^*^). 

2. IMPROVING THE PGSM UPDATE 

The PGSM algorithm averages the stochastic 
estimator of the fermionic determinant together 
with the gauge configuration ensemble average 
and it can be efficient only if the standard de- 
viation of the stochastic estimator is small. The 
standard deviation of detA~^ can be written as 



= < e 

= dct(2A - 1)-^ - det{A) 



> - < e 

-1 J„+//l\-2 



a A diverges if even one of the eigenvalues of A is 
less than 1/2, which is likely to occur if fl and fi' 
differ significantly We consider two separate 
methods to solve this problem. 
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1. Reduction: The fermionic matrix re- 
duction removes the most UV part of the 
fermionic operator by defining a reduced matrix 
D,r = D,e~^^^^^ and rewriting the fermionic action 
as 

Sf{V) = -^(in det nr{V) + 2Tif{n)y (3) 

If the function / is a polynomial of il, Trf{il) 
can be evaluated exactly and only the determi- 
nant of the reduced matrix fir has to be calcu- 
lated stochastically. The parameters of / are cho- 
sen such as to minimize the eigenvalue spread of 
ilr- With / a cubic polynomial the conditioning 
number of the fermionic matrix can be reduced 
by about a factor of 30. 

2. Determinant breakup: Writing the 
fermionic determinant in the form det A^^ = 
(det A^/"')^"'' suggests that the stochastic part 
of the estimator can be evaluated using nbUf/A 
independent Gaussian vectors with the replace- 
ment 

e-r(^"//*-i)? ^ g-E:=:^''«n^^^"'-i]«.. (4) 

The standard deviation of the stochastic es- 
timator is now finite as long as the ma- 
trix nr^''^{V')nr{V)Vtr'^''^{V') haS no eigenvalue 

smaller than 1/2"'', a condition that can always 
be satisfied Q. 

The effectiveness of the fermionic reduction 
and determinant breakup is illustrated in fig- 
ure where we show the stochastic estimator on 
a specific A matrix using no improvement, with 
fermionic reduction but no determinant breakup, 
and with fermionic reduction and rit, = 8 deter- 
minant breakup. Observe the 16 orders of mag- 
nitude difference between between figure |l|a and 

3. TUNING THE ACTION PARAME- 
TERS 

Improving the stochastic estimator ensures 
that the acceptance rate of the PGSM update is 
close to the theoretical maximum, the exact ratio 
of the determinants. But the acceptance rate will 
be large only if the exact ratio is large. That will 
be the case if the configurations proposed by the 
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Figure 1. The stochastic estimator on a given 
fermionic matrix A with a) no improvement; b) 
fermionic reduction but no determinant breakup; 
c) fermionic reduction and rib = 8 determinant 
breakup. 

pure gauge action Sg{U) are close to the dynam- 
ical configurations. Chosing Sg{U) such that it 
matches the lattice spacing and/or short distance 
fluctuations of the expected dynamical configu- 
rations can maximize the acceptance rate. Such 
a matching requires the shift of the pure gauge 
coupling towards the appropriate quenched value. 
This shift can be compensated by the Sg{V) term. 
Sg{V) is taken into account in the accept-reject 
step but since it depends on the smeared links 
only, it fluctuate much less than Sg{U) and does 
not affect the acceptance rate significantly 

4. THE EFFECTIVENESS OF THE 
PGSM UPDATE 

The effectiveness of the PGSM algorithm de- 
pends on both the partial-global heat bath update 
and the stochastic estimator. The autocorrela- 
tion time of the simulation is the larger of the au- 
tocorrelation times of the heat bath and stochas- 
tic steps. We have measured the integrated auto- 
correlation time of the plaquette as the function 
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Figure 2. Autocorrelation of the Uf = 2 runs. 
Open squares: n?, = 8, am = 0.10, filled squares: 
fib ~ 12, am = 0.10, crosses: ni, — 12, am — 0.04 
runs. The lines corresponds to the expected auto- 
correlation based on the heatbath update alone. 

of the number of links updated at one heat bath 
step, tHB @- The autocorrelation time of the 
pure gauge partial-global heat bath update can 
be expressed as 



'P9 



THB 



tHSra ' 



(5) 



where thb is the autocorrelation time of the pure 
gauge heat bath update, V is the volume of the 
lattice, and Va is the acceptance rate. All our 
simulations were carried out on 8^ x 24 lattices at 
lattice spacing a ~ 0.17 fm with Uf = 2 fermion 
flavors. We considered two quark mass values, 
am ~ 0.10 where mT^/mp ^ 0.71, and am = 0.04 
where m^^/mp ~ 0.54. In figure ^ we plot Tauto 
for the different runs showing the dependence on 
the quark mass, determinant breakup, and tnB- 
We conclude this paper by translating the auto- 
correlation time measurements to computer time 
requirements. In figure || we plot the computer 
time measured in fermionic matrix M'^M multi- 
plies to create configurations that are separated 
by 2rauto update steps. Using thin link fermions 
with standard small step size algorithms, it takes 
about 8 X lO'^ M^M multiplies to create indepen- 
dent configurations with am = 0.10. On these 
10 fm"* lattices it is actually faster to create HYP 
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Figure 3. Cost of creating independent configu- 
rations, measured in fermionic matrix multiplies. 
Notation is the same as in figure ^. 

smeared configurations with PGSM than thin link 
ones with a small step size algorithm. The PGSM 
algorithm scales with the square of the lattice 
volume. To repeat the above measurements on 
100 fm'' lattices would increase the computer time 
by 100. A factor of 10 increase is due to the 
increased volume, and the other factor of 10 is 
due to the increased autocorrelation time. Even 
on 100 fm^ volume the PGSM simulation with 
am — 0.10 will be only a few times slower than 
thin link fermion simulations. Simulation cost 
at smaller lattice spacing but same physical vol- 
ume increases only linearly with the lattice vol- 
ume as the number of links that can be effectively 
touched increases accordingly. 
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